Phrasal Paraphrase Based Question Reformulation for Archived Question Retrieval

نویسندگان

  • Yu Zhang
  • Wei-Nan Zhang
  • Ke Lu
  • Rongrong Ji
  • Fanglin Wang
  • Ting Liu
چکیده

Lexical gap in cQA search, resulted by the variability of languages, has been recognized as an important and widespread phenomenon. To address the problem, this paper presents a question reformulation scheme to enhance the question retrieval model by fully exploring the intelligence of paraphrase in phrase-level. It compensates for the existing paraphrasing research in a suitable granularity, which either falls into fine-grained lexical-level or coarse-grained sentence-level. Given a question in natural language, our scheme first detects the involved key-phrases by jointly integrating the corpus-dependent knowledge and question-aware cues. Next, it automatically extracts the paraphrases for each identified key-phrase utilizing multiple online translation engines, and then selects the most relevant reformulations from a large group of question rewrites, which is formed by full permutation and combination of the generated paraphrases. Extensive evaluations on a real world data set demonstrate that our model is able to characterize the complex questions and achieves promising performance as compared to the state-of-the-art methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Interrogative Reformulation Patterns and Acquisition of Question Paraphrases

We describe a set of paraphrase patterns for questions which we derived from a corpus of questions, and report the result of using them in the automatic recognition of question paraphrases. The aim of our paraphrase patterns is to factor out different syntactic variations of interrogative words, since the interrogative part of a question adds a syntactic superstructure on the sentence part (i.e...

متن کامل

Exploring Key Concept Paraphrasing Based on Pivot Language Translation for Question Retrieval

Question retrieval in current community-based question answering (CQA) services does not, in general, work well for long and complex queries. One of the main difficulties lies in the word mismatch between queries and candidate questions. Existing solutions try to expand the queries at word level, but they usually fail to consider concept level enrichment. In this paper, we explore a pivot langu...

متن کامل

iSTART: Paraphrase Recognition

Paraphrase recognition is used in a number of applications such as tutoring systems, question answering systems, and information retrieval systems. The context of our research is the iSTART reading strategy trainer for science texts, which needs to understand and recognize the trainee’s input and respond appropriately. This paper describes the motivation for paraphrase recognition and develops ...

متن کامل

Retrieval Models and Q and A Learning with FAQ Files

The idea of compiling knowledge into FAQ files has existed for some time. The Usenet became an early repository of on-line FAQ files, and currently the Internet FAQ Archives web site (http://www.faqs.org) has 2490 " popular " FAQs archived. There are many other sources of FAQ files. Call-center manuals are also often structured as FAQ files. As the world wide web has become widely accessible, F...

متن کامل

iSTART: Paraphrase Recognition

Paraphrase recognition is used in a number of applications such as tutoring systems, question answering systems, and information retrieval systems. The context of our research is the iSTART reading strategy trainer for science texts, which needs to understand and recognize the trainee’s input and respond appropriately. This paper describes the motivation for paraphrase recognition and develops ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2013